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ABSTRACT 


The  occurrence  of  independent  events  at  random  In  the 
plane,  i.e.  the  formation  of  a  planar  point  process,  is 
discussed.  Both  homogeneous  and  nonhcmogcneou3  processes 
are  considered.  A  specific  functional  form  for  the  parameter 
in  a  nonhomogeneous  planar  Poisson  process  is  used  to 
illustrate  the  development  of  test  and  parameter  estimation 
techniques.  The  problem  finds  application  in  the  description 
of  biological  phenomena  as  well  as  In  search  and  detection 
problems. 
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I.  INTRODUCTION 


Many  problems  arising  naturally  in  a  physical  sense  are 
often  so  complex  that  the  identification  and  description  of 
underlying  mechanisms  must  use  the  tools  of  probability  and 
statistics.  Some  of  the  reasons  leading  to  the  requirement 
of  using  these  tools  are: 

(i)  the  data  base  may  be  so  large  or  complex  as  to 
preclude  identification  of  any  driving  mechanism 
without  recourse  to  statistical  analysis; 

(ii)  if  identifiable,  the  mechanisms  may  be  inherently 
probabilistic;  or 

(iii)  if  identifiable  and  deterministic,  the  governing 
law  which  the  mechanisms  obey  may  be  unknown. 

This  paper  is  concerned  with  the  use  of  statistics  in 
the  identification  and  mathematical  description  of  the  spatial 
distribution  of  events  (occurrences).  Included  is  the  detec¬ 
tion  and  estimation  of  parameters  which  influence  the 
description  of  this  distribution. 

The  area  of  concern  here  is  a  departure  from  those  sta¬ 
tistical  methods  which  have  been  developed  to  detect  the 
effect  of  varying  a  controlled  segment  of  the  underlying 
mechanism.  Among  those  methods  would  be  the  design  of  exper¬ 
iments,  regression  analysis,  time  series  analysis,  and 
analysis  of  variance.  One  goal  of  such  analysis  is  to 
hopefully  predict  the  advisibility  of  pursuing  some  course 
of  action. 
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In  the  basic  model  of  l:  -per,  events  are  considered 

to  occur  with  a  Poisson  distribution  in  the  plane.  This 
"is  the  natural  model  for  the  _\pression  that  ’points  are 
distributed  at  random',"  [P-shor,  1972,  p.  1^1].  The  bi¬ 
variate  Poisson  process  will  bo  defined  and  then  developed 
through  the  use  of  partial  differential-difference  equations, 
a  widely  repeated  procedure  ir.  the  univariate  case  but 
neglected  in  the  bivariate  case. 

Initially  a  homogeneous  Poisson  process  will  be  assumed 
to  control  the  underlying  mechanisms.  Then  trends  will  be 
introduced  by  defining  the  Poisson  parameter  in  such  a  way 
as  to  make  it  be  spatially  dependent.  This  will  be  the  basis 
for  the  definition  of  the  non-homogeneous  Poisson  process. 

Time  inhomogeneity  will  not  be  considered.  Thus,  the  data 
are  assumed  to  be  taken  concurrently,  i.e.,  the  period  of 
observation  is  short  compared  to  any  period  of  change  of 
the  parameters. 

Tests  will  be  developed  to  distinguish  between  homogeneity 
and  non-homogeneity  and  the  method  of  maximum  likelihood 
will  be  used  to  develop  estimates  of  the  parameter  in  the 
homogeneous  case  and  parameters  in  the  non-homogeneous  case. 

In  the  latter  case,  conditional  likelihood  techniques  will 
be  utilized  to  develop  tests  and  estimates.  Throughout, 
testing  and  estimation  procedures  will  be  based  on  a  single 
realization  of  the  process  which  consists  of  the  number  of 
events  observed  and  their  spatial  locations. 
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The  problem  of  concern  finds  application  in  the  estima¬ 
tion  of  the  density  of  trees  in  a  forest;  here  one  might  be 
concerned  with  estimating  the  potential  yield  of  lumber  from 
a  given  forest  area  where  inhomogeneities  arise  due  to  soil, 
weather  patterns,  topography  and  other  physical  reasons. 

Another  application  might  be  in  naval  search  and  detec¬ 
tion  problems.  For  example,  one  might  be  searching  for  a 
merchant  ship  in  distress  whose  location  is  not  known  exactly 
due  to  failure  of  the  ship's  communication  equipment.  Here 
the  independence  assumptions  of  the  planar  Poisson  process 
may  be  valid,  but  not  the  assumption  of  homogeneity.  In¬ 
homogeneities  of  location  occur  because  of  preferred  sea 
lanes  and  physical  characteristics  of  the  ocean  and 
atmosphere. 
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II.  THE  HOMOGENEOUS  POISSON  PROCESS  IN  THE  PLANE  (HPPP) 


A.  GENERAL  DEVELOPMENT 

Consider  a  stochastic  process  of  events  occurring  in 
the  plane  (i.e.,  a  so-called  point  process)  which  is 
characterized  by  the  assumptions 

I.  There  exists  a  finite  positive  constant  X  >  0. 

II.  For  any  Integer  k  >_  1  and  any  set  of  non-overlapping 

regions  Rk  with  areas  (in  the 

usual  geometric  sense),  the  number  of  events  occurring 
in  any  region  i,  denoted  N(R, ),  has  a  Poisson  dis¬ 
tribution  with  parameter  XAi  which  depends  only  on 
the  area  of  the  region,  ,  and  not  its  shape.  Thus, 

ni 

(XA,  )  1exp(-XA. ) 

prob  (N(Ri)  *  n1>  =  - pj— j -  .  (1) 

III.  Further,  N ( Ri ) ,  i  =  l,2,’**,k,  are  mutually  indepen¬ 
dent  in  that  N(R^)  is  not  affected  by  the  occurrence 
of  events  in  any  other  region  or  in  any  grouping  of 
the  regions,  G,  as  long  as  RjHg  =  0.  Thus 

n.  -XA. 
k  (XA. )  xe  x 

prob(N(R,  )=n . ,  i=l,***,k}=  II  - « -  (2) 

1  1  i=l  ni‘ 

Definition  1:  If  a  process  obeys  the  above  assumptions  it  is 
called  a  homogeneous  planar  Poisson  process  (HPPP). 

For  reasons  of  arbitrary  shape  the  above  basic  definitions 
will  suffice.  However,  under  certain  geometrical  assumptions,  an 
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equivalent  definition  for  the  HPPP  can  bo  achieved  in  a  man¬ 
ner  similar  to  the  development  of  the  univariate  Poisson 
process  through  the  use  of  partial  differentlal-dif ference 
equations.  This  is  useful  for  the  development  of  statisti¬ 
cal  properties  and  will  be  very  important  in  the  development 
of  the  non-homogeneous  process.  Such  a  development  also 
provides  another  phenomenological  approach  to  the  homogeneous 
Poisson  process,  one  which  might  arise  through  the  struc¬ 
turing  of  a  model  for  instance.  For  illustrative  purposes 
the  following  development  will  be  accomplished  using  rectan¬ 
gular  regions.  Note  that  the  development  is  very  dependent 
on  the  geometry  involved;  hence  developments  with  other 
geometries  (e.g.  circular  regions)  must  proceed  somewhat 
differently. 

The  underlying  assumptions  in  the  differential  equation 
development  will  be 

I'.  There  exists  a  finite  positive  constant  X  >  0. 

II'.  For  any  region  R*  with  incremental  area  AA,  inde¬ 
pendent  of  the  shape  of  the  region  except  possibly 
as  noted  above 

(a)  prob  {no  event  In  R*}  *  1  -  XAA  +  o(AA), 

(b)  prob  {one  event  In  R*}  *  XAA  +  o(AA), 

(c)  prob  {more  than  one  event  in  R*}  e  o(AA), 

where  ”g(AA)  is  o(AA),?  means  11m  *  o,  or 

AA+0 

specifically  in  rectangular  regions  the  limit  as  Ax 
or  Ay  or  both  go  to  zero  of  frvf:"  is  zero. 
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III'.  The  occurrence  of  events  in  rt*  is  independent 
of  the  occurrence  of  events  in  any  region  R+ 
where  RKflR1  *  0. 

It  will  he  shown  that  I',  II  and  III'  imply  and  are 
implied  by  I,  II  and  III  so  that  ohe  two  sets  of  assumptions 
are  equivalent  and  hence  the  incremental  assumptions  give 
rise  to  a  HPPP.  Clearly  I  and  I'  are  the  same, as  are  III 
and  III'.  Also  II  implies  II'  since  by  (1) 

2 

(a)  prob  {N(R*)  =  0}  =  e_XAA  =  1  -  XAA  +  (AA)2  -  ... 

2 

=  1  -  XAxAy  +  Ax2Ay2  -  ... 

=  1  -  XAA  +  o(AA) , 

with  the  definition  of  o(AA)  given  above.  Also 

(b)  prob  {N(R* )  =  1}  *  XAAe"XAA  =  XAA(1  -  XAA  +  ...) 

I 

*  XAA  +  o(AA) 

00  -XAA 

and  (c)  prob  {N(R«)  >  2}  =  Z  K  ■ ■’  -  =  o(AA). 

i*=2  * 

The  problem  remaining  in  order  to  demonstrate  equivalence 
between  the  two  sets  of  assumptions  is  to  show  that  II' 
implies  II. 

Consider  a  region  R  bounded  by  the  co-ordinate  axes  and 
lines  x  =  X*  and  y  =  Y*,  with  area  X*Y*.  Now  extend  the 
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sides  to  x  B  X*+Ax  and  y  »  Y*+Ay  (see  Figure  1).  Consider 
the  probability  of  n  events  occurring  in  the  extended 
region,  R'  »  R  +  R1  +  R2  +  R^»  where: 

(a)  R  has  area  X*Y* , 

(b)  R^  has  area  X*Ay, 

(c)  R2  has  area  Y*Ax, 

(d)  R^  has  area  AxAy; 

(a)-(d)  imply  R*  has  area  X*Y*  +  X*Ay  +  Y*Ax  +  AxAy. 
The  assumptions  I',  II’,  and  III'  imply 


prob  (no  event  in  R^}  c  1  -  XX* Ay  +  o(X*Ay), 

prob  {one  event  in  R^}  B  XX*Ay  +  o(X*Ay),  (3) 

prob  {more  than  one  event  in  R^}  =  o(X*Ay); 

prob  {no  event  in  R2 }  *  1  -  XY*Ax  +  o(Y*Ax), 

prob  {one  event  in  R2)  *  XY*Ax  +  o(Y*Ax),  (M 

prob  {more  than  one  event  in  Rj}  ®  o(Y*Ax); 

and 

prob  {no  event  in  R,}  c  1  -  XAxAy  +  o(AxAy), 

prob  {one  event  in  R^}  e  XAxAy  +  o(AxAy),  (5) 

prob  {more  than  one  event  in  R^}  *  o(AxAy). 

Moreover,  statements  ( 3 ) ,  (*0,  and  (5)  are  independent. 

It  is  noted  that  the  above  equations  may  have  two 
different  interpretations.  For  instance  in  (3),  prob{one 
event  in  R^}  ®  X*Ay  +  o(X*Ay)  is  interpreted  to  mean  one  event 
in  a  two-dimensional  process  with  parameter  X  and  area  X*Ay. 
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ly 


Y*+Ay 

Y* 


i 


Figure  1.  The  incremental  increase  of  a 
i  rectangular  region. 
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Kowcv*  r,  another  interpretation  would  be  to  consider  the  one- 
diriM’.nional (marginal)  process  of  events  projected  onto  the 
.•-axis,  in  v;hich  case  the  parameter  is  XX*  and  the  incremental 
interval  has  length  Ay. 

For  notational  convenience,  let  P  (X*,Y*)  denote  the 
probability  that  n  events  occur  in  a  region  with  area  X*Y*. 

The  differential-difference  equations  are  written  noting  that 
n  events  may  occur  in  an  extended  region  by  having  n  events 
in  the  unextcnded  region  and  no  events  in  the  extension, 
n-1  events  in  the  unextended  region  and  one  event  in  the 
extension,  etc.  Hence 


Pn(X«+Ax,Y«)  -  Pn(X«,Y«)  •  P0(Ax,Y») 

♦  Pn-10(«,Y«)  •  P^  ( Ax ,  Yfc ) 

♦  Pn_2(X»,Yf)  •  P2(Ax,Y«)  +  ... 

■  P  (X* ,Y*)[1-XY* Ax j  ♦  P„  , ( X*Y* ) [XY*Ax ]  +  o(Y«Ax). 
n  n-i 

(6) 


Similarly, 


Pn(X#,Y*+Ay)  -  Pn(X«fY*»)[l-AX«Ay]  *  Pn_1(XB  ,Y*)[XXiAy3  ♦ 
♦  o(XfAy ) ,  (7) 

and 
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Pn(X»+£x,Y#+Ay)  -  P^ ( X* ,Y* ) [l-XYHAx] [ l-XX* Ay ] [ 1-XAxAy ] 

+  Pn-i(X»,y«)[XY»Ax(l-XX«Ay)(l-XAxAy) 

+  XX*Ay  ( l-XY* Ax )  ( l-XA:<Ay ) 

+  XAxAy(l-XX*Ay )  (l-XYRAx) ] 

+  Pn_2(X»,Y»)[XX«Ay  •  XY«Ax(l-XAxAy) 

+  XX^Ayd-XY^AxUAxAy 
+  ( l-XX*Ay ) XY*AxXAxAy ] 

+  Pn_3(X»,Y«)[X3X«Y»Ax2Ay2]  (8) 

+  o(AxAy)  +  o(X*Ay)  +  o(Y*Ax). 

Interpreting  the  above  equations,  the  third  tern  on  the 
right  hand  side  of  (8),  for  example,  states  that  there  can 
be  n  events  in  the  extended  region  P.'  if  there  are  n-2 
events  in  R  and  exactly  one  event  In  each  of  any  two  of  the 
added  regions.  That  is,  there  can  be  two  events  in  the 
added  regions  R^,  R^  and  if  one  occurs  in  each  of  two 
regions  and  none  occurs  in  the  third  region,  i.e.,  one  in 
,  one  in  Rj  and  none  in  R^,  etc.  Collecting  all  terns  of 
order  o(AxAy),  o(X#Ay)  and  o(Y"Ax),  (8)  reduces  to 
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Pn(X«+M)Y'+«)‘>VCX^Y*U1-W*iX'XX,W‘UXM  +  X2X,i,'iXlyl 

+  p  (x»,Y*)[XY«Ax-2X?X*Y«AxAy+XX«Ay+XAxAy] 

n-1 

,  P  (X«, YM[X?X»Y«AxAy]+o(AxAy)+o(X«Ay)+°(V  Ax).  ^ 

n-2 

.  P  (X* ,Y* ) C 1-XY* Ax ]+Pn_^ ( X*  ,Y*) [ XY* Ax]+o(Y* Ax ) 

n 

♦  p  (x»,Y*)[i-xx*Ay3*i,n.i(1',>!t*Uxx''''y3+0<X,Ay) 

n 

-  Pn(x»,Y»)-XPn(X",YMaxiy+XPn_1(X*,Y»>AxAy 

t  ,W[fn(Y*.Y*)^,-1<*,.Y,>*Pn-2(X,-V,)1“MW(“W)' 

Kotlne  froa  equation.  (6)  an.  (7)  that  the  first  three 

terns  on  the  right-hand  side  of  the  above  equation  are 

p  (X.4M,Y*)  and  the  next  three  are 
n 

rewriting  (8').  the  result  Is 


P  (X*+Ax 
n 


,Y.tay).?n(x>hax,Y*)+Pntx*,Y'*iy:-Pn<x,.Y,) 

-  xPn(x,,Y*)axay*xPn.1tx<',,,)ix'1Y  (8  ’ 

♦  X2X«Y*[Pn(X*  ,Y*)-JPn.i(x'  .Y*>+Pn-2(X*  •Y*llxw 

♦  o(AxAy). 

The  definition  of  the  second  partial  derivative  with 
respect  to  two  variables  is 


By 1 3x 


J"]  im  0  im 
A y-0A'  Ax-0 


F(x+a>  >:,  ■*  ‘ v )-?■'( x  ,y+Ay) 
Ax 


.  lim  ?:.C:<±A>_1jL).-KiL.yl) 

.  Ax  ' 

Ax-J 


Hence,  transposing  ’he  first  three  terms  of  equation  (8") 
to  the  right  hand  side,  dividing  by  AxAy  and  then  taking 
the  double  limit  results  in 


a?Pn(x«,y») 

3x3y 


-XFn(X*,Y»)  +  Xrn_1(X»,y«)  +  A?X»Y»[Pn(X,‘,Y«) 
-2Pn-1(X«,Y»)+Pn_2(X»,Y«)]  (9) 


The  solution  to  (a  partial  differential-difference 

equation)  is  easily  shown  to  be 


P  (X»,Y»)  -  K(  AX»Y«)noxp(-XX»Y«)  ,  n  -  £,3, -  (10a) 

n  n! 

where  K  is  an  arbitrary  constant.  Special  considerations 
are  needed  for  n  B  0,1  since  for  these  cases  some  of  the 
terms  in  (8")  and  (9)  are  not  defined.  Rewriting  (8")  and 
(9)  while  concurrently  eliminating  the  proper  terms  leads  to 


Pn(X\Y») 


X ( XXiYi )nexp(-XXfY* ) 

- nd  - 


n  ■  0,1,2,.  . .  (10b) 


Since  ?n(X*,Y*)  is  a  probability  statement  and  for  any 
given  region  the  number  of  events  in  that  region  must  be 
some  non-negative  integer,  the  constant  K  is  seen  to  be  unity. 
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Hence  (10b)  is  equivalent  to  (1)  which  was  to  he  proven. 

Thun  the  two  nets  of  acnurr.pt ionn  imply  the  name  thir  , 
namely  that  the  number  of  events  in  a  region  hac  c.  pr»  won 
distribution  with  parameter  proportional  to  the  arc  c  ©'  the 
region  and  independent  of  its  shape  and  the  number  •  r:-i 
position  of  events  outside  the  region.  Note  that  toe 
formulation  excludes  multiple  events,  i.e.,  the  oncu •  '.ce 
of  two  or  more  events  at  any  point  or  on  any  line  :  nir.„''e 

added  region  such  as  R^^  in  Figure  1. 

Also  a  similar  derivation  will  go  through  for  <  Cellar 
regions  using  polar  co-ordinates,  but  there  are  d’_:  r.c'"- 

in  the  special  properties  of  the  Poisson  process  ar  fined 
through  assumptions  I,  II  and  III  in  differently  si  :; 
regions.  These  are  discussed  below.  The  dlffcro*  <  es 
the  special  properties  of  the  non-homogeneous  plan:  i  r  r  .  r . 

processes  as  they  vary  with  different  geometries  :  . 

essential  element  of  the  analysis  of  points  (evens' 
the  plane. 

B.  TESTING  DATA  FOR  HOMOGENEOUS  PLANAR  POISSON  PrtOCKSS  (HF'PP) 

Given  the  occurrence  and  spatial  location  of  n  events  In 
a  rectangular  region  of  area  X*Y*,  consider  the  problem  of 
determining  whether  or  not  these  points  occur  as  realisations 
of  the  HPPP.  Miles  [1970, p. 89]  has  stated  a  consequence  of 
definition  1  as 

Corollary.  Assume  a  rectangular  region  R^  with  area  A^. 

Given  N(R^)  ■  r.  and  0  <  A^  <  ®,  the  n  points  are  independently 
and  uniformly  distributed  in  R^. 
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in  a  rectangular  region 


Proof:  Let  A^  *  X#Y#  where 
bounded  by  the  coordinate-  axes,  x  ^  X#  and  y  «=  Y*.  Label 
the  n  given  points  in  ;  ny  conv'*n!  or.t  tanner,  e.g.,  on  the 
magnitude  of  the  y-ccmponent .  Let  (x,y)^j  denote  the  ith 
labelled  evc-nt.  Consider  an  incremental  region  with  area 
dxdy  which  has  the  prop  .1;  ol;  {exactly  one  event  in  the 

incremental  region  of  :  .  ;  »-  P^(dx,dy)  *  Xdxdy  +  o(dxdy). 

Consider  now  n  incro::..  ;>s  ax^ly^  i  c  l>,**,n, 

placed  in  Ri<  Ignor-1 r.r  :  : ’.-•fifties  of  o(dxidy1),  assump¬ 
tions  I,  IT  and  III  ::  ’  ’hat  ,hc-  joint  probability  that 

the  ith  event  falls  ir.  the  J  nc:  -- ment  al  rectangle,  dx^dyj^, 
i  ■  l,”  •  ,n,  and  exactly  n  even.  s  occur  altogether  in  X*Y* 
is  given  by 

Adx-^dy^. . .  Xdx  <  { .  \x#Y# ) . 

Restating  in  terms  o:  +  t<  deaaty  function, 

fUx.yJ^j,.. .  ,(x,y)^pj  ,n;X}  *=  Xnexp{-XX*Y*} , 

where  f(...}  is  the  Joint  density  of  (x,y)(^,  i  *  l,...,n, 

and  the  probability  that  the  number  of  events  in  X*Y*  is  n. 

The  exponential  term  In  the  above  expressions  is  an  approx- 

n 

imation  to  exp(-(XX*Y*  -  Z  Xdx.dy.H,  i.e.,  represents 

i  =  l  1  1 

the  probability  of  no  events  within  the  region  X*Y*  but 
outside  the  incremental  regions  containing  each  event. 

By  conditioning  on  the  occurrence  of  n  events  in  the 
region  which  are  distributed  Poisson  with  parameter  XX#Y*, 


T 


.  ,  ,  .x>  .  - .  n  >  1 

f{(:'.,y)(1)»--*(x»y)(n)  •“»  (XX*Y*)ncxp(-X>:*Y‘l) 


n! 


n! 


(X*Y*) 


n 


(11) 


w!,ich  1,  the  Joint  distribution  for  n  bivariate  uniform 
random  variables  ordered  on  one  of  the  random  variables  as 
is  shown  in  Appendix  A.  Note  also  the  independence  of  the 
conditioned  density  from  the  parameter  X,  i.e.  the  random 

variable  N  is  a  sufficient  statistic  for  X. 

AS  a  consequence  of  the  above  corollary,  it  is  apparent 
that  if  the  points  of  the  HPPV,  conditioned  on  the  number 
Of  events  observed  to  occur,  are  in  fact  ordered  with 
respect  to  the  increasing  magnitude  of  the  y-eomponent,  then 
no  "information"  is  available  about  the  ordering  of  the  x- 
components,  i.e..  each  of  the  nl  ordering  that  can  be 
induced  on  the  x's  by  the  orderings  on  the  y's  has  probabl  Ity 
1/n!.  This  is  readily  apparent  since  in  the  bivariate  uni  irm 
ease  the  two  components  were  Independently  selected.  Hone  , 

If  <x,y)(k)  is  determined  by  (*k.y<k)>.  1>e-  thc  polntS  ar<! 
labelled  by  the  ordered  y-component,  then 


prob(Xk  -  X(J)>  ■  „ 


1  J  -  1»2,. .  •  »n 


where 


X,  \  is  the  jth  \  m  magnitude,  and 

(J )  * 


probtXj  -  X(J).  . .  V*0 «)•  k-1 . 

v  v  1  -  L-  <«) 

Xn  "  XUr  n! 
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Hence  if  the  x-components  of  the  pointr.  ordered  on  the 
y-components  exhibit  any  natural  ordering  then  the  x-  and 
y-components  have  not  been  independently  selected  and  the 
observed  process  cannot  be  a  HPPP.  This  will  be-  the  basis 
for  many  of  the  tests  for  a  HPPP  against  a  non-homogeneous 
planar  Poisson  process  to  be  discussed  later. 

Lemma :  If  the  bivariate  process  is  Poisson  and  the  regions 
are  rectangular,  then  the  projections  of  the  events  onto 
the  coordinate  axes  may  be  shown  to  be  univariate  Poisson. 


Proof:  Consider  the  occurrence  of  events  in  a  rectangular 
region  of  area  X*Y*.  Then  by  III  the  occurrence  of  an  event 
in  an  incremental  strip  is  independent  of  oil  occurrences 
outside  the  strip.  Hence  the  projections  onto  the  coordinate 
axes  give  rise  to  independent  counts  along  the  axes. 


p  (x  Y«)  -  ( XY*x)nexp(-XY*x) 


n  *  0,1 , . . . 
0  <  x  <  X* 


and 

p  (X>  y)  -  »X"y>Vp(-XX»y) 
nv  '  nT 


n  ■  0,1 , . . . 
0  <  y  <  Y* 


which  gives  the  univariate  Poisson  distributions  with 
parameters  XY*  and  XX*  respectively. 

Note  here  the  inherent  dependence  on  the  shape  of  the 
assumed  regions.  In  using  rectangular  regions  equal  lengths 
in  the  marginals  reflect  equal  areas  in  the  bivariate 
distribution. 
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If  the  regions  were  circular  then  vertical  projections 
onto  the  axes  would  represent  decreasing  area  as  the  dis¬ 
tance  from  the  origin  increased.  Since  the  occurrence  of 
events  is  assumed  to  be  proportional  to  the  area  projected, 
an  actual  HPPP  would  induce  a  non-homogeneous  process  on 
the  marginals  due  to  the  distortion  in  the  mapping.  For 
clarification,  refer  to  Figure  2.  However,  if  the  regions 
are  circular  then  radial  projections  could  be  utilized  so 
that  the  event  occurring  at  (Xq ,yg )  in  Figure  2  is  repre¬ 
sented  in  the  x-marglnal  by  an  event  at  x?.  To  define 
equal  area  projections  in  this  case  the  transformation 

p 

x  x  «*  x'  Is  made,  in  which  case  a  unit  increase  in  x' 
defines  the  addition  of  a  unit  amount  of  area  to  the  region 
For  example,  if  a  unit  area  is  generated  by  a  circle  of 
radius  r  «  1,  then  the  area  enclosed  in  the  ring  of 
1  <  r  <_{2  is  the  unit  area,  as  is  the  area  in  the  ring 
<  r  </r,  etc.  In  general  ,  </n*  <  r  £  defines  in 
polar  coordinates  a  ring  with  unit  area. 

Returning  to  the  assumption  of  rectangular  regions, 
three  characteristics  of  the  HPPP  are  now  available  which 
can  be  used  as  the  basis  for  testing  a  sample  for  belonging 
to  the  HPPP  description  of  events  in  a  rectangular  region  R 

(A)  Independence  of  the  x-ordering  from  the  ordering 
on  the  y-components. 

(B)  Univariate  HPP  (homogeneous  Poisson  process)  in 
the  x-margir.al  and,  conditionally  on  n  events  in  R,  a 


Figure  2.  Vertical  and  radial  projections  of  an 
event  to  form  the  marginal  process.  Shaded  regie  .s 
represent  the  deviations  of  projected  areas  arising 
from  the  rectangular  projection  of  circular  areas. 
Thus,  the  shaded  regions  indicate  the  degree  of 
non-homogeneity  induced  by  the  mapping. 


uniform  distribution  of  the  distances  to  events. 

(C)  Univariate  1IPP  in  the  y-marginal  and,  conditional ly 
on  n  events  in  R,  a  uniform  distribution  of  the  distance: 
to  events. 

Property  (A)  can  be  tested  against  general  alternatives 
using  a  rank  correlation  procedure  (or  Spearman's  correla¬ 
tion,  see  Pearson  and  Hartley  [1966,  Table  M]).  Pro,’-,  riles 

(B)  and  (C)  can  be  tested  by  standard  univariate  methods 
as  in  Cox  and  Lewis  [1966]. 

Note  that  in  the  above  discussion  the  interest  lie:  in 
the  nature  of  the  process  rather  than  in  specifically 
describing  the  process.  Thus  the  determination  of  the 
parameter  \  of  the  Poisson  process  is  not  a  current  ob.'-e- 
tive  and  it  can  be  considered  to  be  a  "nuisance"  pa-a-.v. ■  r . 
Hence  the  conditioning  argument  above  and  the  result 
independence  of  the  tests  from  the  value  of  the  parameter 
are  Justifiable. 

Now  let  <*A  be  the  probability  of  a  Type  I  error  gener¬ 
ated  in  testing  for  randomness,  aB  be  the  corresponding 
probability  in  testing  for  HPPP  in  the  x-margiral ,  and  ac 
likewise  for  the  y-marginal.  Then  the  probability  of  not 
falsely  rejecting  the  HPPP  hypothesis  due  to  the  randomness 
test  is  1  -  a^,  etc.  Hence  the  combined  probability  of  not 
falsely  rejecting  HPPP  is  1  -  prob  {type  I  error}  or 

1  -  P(I)  -  (1  -  oA)(l  -  oB)(l  -  oc). 
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There  fore 


P(I)  ■  1  -  (1  -  aA)(i  -  aB)(l  -  oc) 


(13) 


Is  the  probability  of  fr.l'.tly  ct.'ng  a  HPPP  hypothesis. 

If  through  physic.'.  1  cor.vlo-T  ‘ions  one  of  the  tests  seem3 
more  or  less  olgnlflo.'.;il  i th  •  ovhers,  the  analyst  can 
choose  the  weightings  •  o  '  *  the  physical  properties. 
Otherwise  the  values  (and  ■ h  \c  •'sts)  can  be  weighted 
equally.  This  need  for  tr.e  '.ru  r.-.l nation  of  weightings  Is 
the  Inherent  dlsadvantr.ro  o'*  •  :  .It  1-level  test. 

The  Individual  test  s  j  r.e  r.tove  will  be  briefly 
described.  For  the  rank  ec  :*rcl:  *  res:  test,  consider  each 
x^  from  fx»y)(i)  which  ir  . : rcJ  on  the  y-conponent .  Also 
consider  the  ordered  rcc.*  'c;.~  dong  the  x  axis,  where 

X1  "  X(J)‘  Thpn 

6  I  (1  -  (J),)2 


1-1 


ra  "  1  " 


n(n  -  1) 


(lM 


where  (J)^  Is  the  position  of  x^  In  the  x-ordered  sequence, 
Is  the  rank  correlation  statistic. 

The  exact  distribution  for  r_  can  be  approximated  by 

s 

fitting  a  distribution  to  Its  moments  as  discussed  by 
Kendall  and  Stuart  [1951,  p.^77].  The  exact  distribution 
of  r  is  tabulated  In  Plcmctrlka  Tables  For  Statisticians 
[1966,  Table  p.?3)  for  observed  values  of  n  between 
and  10,  and  the  Introduction  to  these  tables  gives  approx¬ 
imations  for  10  <  n  <  20  and  for  n  >  20.  Fcr  10  <  n  <  20, 


r  can  be  treated  as  a  product-moment  correlation  coefficient 
s 

between  normally  distributed  random  variables.  For  n  >  20, 

r  */ n-1  is  assumed  to  be  unit  normal. 

In  testing  the  marginal  distribution  for  HPP,  two 

separate  tests  are  proposed.  First,  the  uniform  conditional 

test  is  used  to  test  against  trends  in  the  data.  As  stated 

In  Cox  and  Lewis  [1966,  p.  153],  "If  the  series  has  been 

observed  for  a  fixed  time  t  {length  X*}  and  n  events  occur 

in  (o,t  ){(o,X*)},  then  the  uniform  conditional  test  is 
o 

based  on  the  variables  =  T^/tQ  {  =  X^ ^/X*}  (i=l, . .  .n) 

conditionally  on  N,  being  equal  to  n."  The  {brackets}  are 

zo 

supplied  to  relate  the  material  in  Cox  and  Lewis  [1966]  to 

this  specific  problem,  and  N  *  n  means  the  number  of 

co 

occurrences  observed  is  n.  Note  that  in  the  conditioning  - 
of  the  realizations  the  "nuisance”  parameters  XX*  and  XY* 
are  eliminated. 

Secondly,  a  test  based  on  the  ordered  inter-event  spa  - 
ings  is  used  to  test  Poisson  against  stationary  event 
processes  which  may  be  non-Poisson.  For  this  test,  Durb:'  l’s 
modifications  of  the  uniform  conditional  test  is  used  [Cox 
&  Lewis,  1966,  p.  155].  Referring  to  Figure  3,  Durbin’s 
modification  describes  a  transformation  from  the  random 
variable  X  to  the  random  variable  T  and  then  to  the  random 
variable  S. 

Let  *  X*  -  X^^.  If  the  ^(i)»  ^  *  l,2,...,n, 

describe  the  "times  to  events"  in  a  Poisson  process,  then 
the  Tj,  i  ■  l,2,...,n+l,  are  independent  exponentially 


2*J 


T2  *  X(2)  "  X(l) 

Ti  C  X(i)  ”  X(l-1) 

) 

)  "  T(l) 

)  "  T(i-1) 

S1  S2  S3  S4 

Figure  3*  The  generation  of  the  transformed  variables 


S4  from  the  original  process  X,.x. 


distributed  random  variables  with  parameter  X.  If  the 
T^’s  are  then  ordered  and  the  S^’s  are  generated  as  shown, 
then  the  S^'s  are  Independent  exponential  random  variables, 
where  5^  has  the  expectation  l/( (n+2-1 )X) .  Also  the  trano- 
formation  ■  (n+2-l)S^  deflneo  Independent  Identically 
distributed  exponential  random  variables  with  parameter 

t  1  • 

X,  and  therefore  X,  ■  Z  S.  ,  1  ■  l,2,...,n  deflneo  the 

1  j-1  J 

times  to  events  In  a  Poisson  process  with  parameter  X, 

•  1  n  i 

and  U,  «  w  IS,  Is  the  statistic  upon  which  a  new 
i  *  J 

uniform  conditional  test  13  based. 

Both  tests  should  be  utilized  as  the  uniform  conditional 
test  Is  more  powerful  when  testing  for  trends  while  Durbin’s 
modification  is  relatively  more  powerful  In  testing  against 
stationary  event  process  alternatives.  However,  these 
tests  are  not  independent  of  each  other  and  thus  cannot 
be  combined  as  in  (13)  • 

As  an  alternative  to  the  above  procedure,  the  region 

of  concern  may  be  partitioned  into  several  sub-regions  and 

the  number  of  events  in  each  subregion  used  as  a  basis  for 
2 

X  testing.  This  method  is  discussed  by  Kendall  and  Stuart 
[1951,  pp.  57^-5]  who  mention  the  problem  of  choosing  the 
"right"  partition,  adding  "Whether  a  particular  partition 
has  statistical  interest  pends  on  the  purpose  of  the 
analysis".  Due  to  the  underlying  uniformity  of  the  condi¬ 
tional  distribution,  this  problem  reduces  to  the  selection 
of  the  number  of  regions  which  are  then  used  to  form  equal 
area  sub-regions. 
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Another  alternative  to  t hr  above  tea ting  procedure  is 
the  evaluation  of  the  sample  product -moment  correlation 
coefficient  under  the  bivariate  uniform  distribution.  The 
procedure  is  discussed  by  Kowalski  [197?],  tut  unfortunately 
his  discussion  does  not  address  the  bivariate  uniform 
distribution.  Kowalski  makes  two  points  very  strongly: 
"Firstly,  the  distribution  of  r  (the  sample  product -moment 
correlation  coefficient  under  nor.-normal  assumptions)  may 
differ  from  its  normal-theory  form  and,  secondly,  wc  may  be 
in  a  situation  in  which  p  is  a  poor  measure  of  association." 
Hence,  if  the  exact  distribution  for  r  under  the  bivariate 
uniform  distribution  were  known,  then  an  exact  test  for  the 
HPPF  (Given  the  occurrence  cf  r:  events)  could  be  devised. 

Durbin  [1970]  has  also  proposed  distance  methods  for 
testing  bivariate  distributions.  The  process  herein  described 
is  well-suited  to  the  methods  Durbin  uses  since  he  first 
transforms  the  observations  so  that  they  occur  uniformly 
on  the  unit  square.  Hence  the  natural  transformation 
x'  *  x/X*  and  y'  =  y/Y*  avoids  the  problem  of  possible  la  k 
of  uniqueness  which  is  the  central  objection  to  the  use  of 
distance  methods.  These  methods  allow  the  analyst  to  adopt 
Durbin's  bivai  ate  analog  of  the  Kolmogorov-Smirnov  tests. 

The  advantage  of  this  method  is  the  elimination  of  diffi¬ 
culties  concerning  multi-level  tests  and  partitioning 
tests. 
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The  tents  described  in  this  section  arc  very  general  In 
nature,  t.c. 

Hq:  the  process  Is  HPPP  Is  tented  against 

H^:  the  process  is  not  HPPP. 

Hence  the  alternatives  being  tested  against  are  multitudinous. 
If  it  is  desired  to  test  a  realization  ns  being  from  a  HPPP 
against  a  specific  form  of  departure  from  HPPP,  better  tests 
may  be  defined  based  on  the  nature  of  the  specific  alterna¬ 
tive.  For  instance,  one  such  departure  could  be  non-honoge- 
nelty,  i.e.,  where  A  is  not  considered  to  be  constant  but 
rather  a  function  of  location;  this  subject  Is  considered 
in  chapter  III.  Another  departure  might  be  in  the  nature 
of  the  process  itself.  For  example,  events  may  occur 
according  to  some  fixed  plan  in  which  case  the  process  is 
deterministic  and  thus  non-Poisson.  A  process  may  develop 
in  which  the  occurrence  of  an  event  prohibits  the  occurrence 
of  another  event  for  some  Interval  about  itself,  in  which 
case  events  are  not  Independent  of  other  events  and  are 
thus  non-Poisson. 

It  must  be  remembered,  however,  that  tests  against 
specific  alternatives  may  ignore  some  features  that  a  more 
general  test  would  detect  and  thus  each  individual  specific 
test  applies  only  to  the  specific  form  of  departure  being 
considered . 

Moreover,  in  all  reasonable  stationary  alternatives, 
it  does  not  seem  possible  to  derive  the  likelihood  function 


of  the  obsorvationo.  One  thus  cannot  derive  exact  testa. 

Por  tests  against  specific  alternatives  based  on  distance 
methods,  see  Holgate  [1972].  Teats  based  on  spectra  are 
discussed  by  Bartlett  [196*0. 

C.  SIMULATING  A  HPPP 

Suppose  one  were  concerned  with  searching  for  submarines 
which  ore  assumed  to  be  dispersed  in  such  a  manner  that  the 
locations  at  any  moment  are  generated  by  a  HPPP.  If  one 
search  procedure  is  to  be  selected  from  many  proposed  search 
procedures,  then  a  possible  manner  of  comparing  the  effec¬ 
tiveness  of  the  proposed  procedures  is  to  utilize  each  pro¬ 
cedure  against  several  simulated  u versions.  In  such  a 
simulation,  the  only  "variable”  which  would  be  of  interest 
would  be  the  procedures,  sc  all  variables  such  as  detection 
and  classification  parameters,  facilities  available,  etc., 
would  remain  constant.  Another  problem  which  might  be 
considered  would  be  the  effect  of  the  change  of  such  param¬ 
eters  on  the  search  procedure  selected  (i.e.,  a  sensitivity 
analysis  of  the  procedure  to  assumed  operating 
characteristics  and  facilities). 

By  the  initial  remarks  of  Section  B  above  and  the 
statement  of  equation  (12),  several  methods  of  artificially 
generating  realizations  of  a  HPPP  can  be  determined.  These 
methods  may  then  be  utilized  to  simulate  the  HPPP. 

Assume  that  the  parameter  XX*Y*  is  given.  To  select 
the  number  N  of  events  to  be  observed  in  the  region  with 
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area  X*Y*,  generate  a  random  number  U  distributed  uniformly 
on  [0,3 ].  Set  N  ■  n  if 


n-1 

£ 

1 


V  M  •  •  M  \  i 


^  ( >X*Y* )  exp (-XX* Y* )  .  ..  .  "  (AX»Y«)Aexp(-XX»Y«) 

i  r,  -  ii 

11  (15) 


V  «  \  i  - 


The  r»u;rjr.atIor.n  can  be  evaluated  using  either  x  or  Gamma 
Ir.teg  ’ul  Tables  [Cox  and  Lewis,  1966,  p.2l<): 


n-1  /  *i  -y 

E  M-S. 

i 


Tf 


prob  tx2n  >  2yJ 


00  vn-l  -v 

1  JTK=frrdv- 


N’oxt,  consider  a  random  variable  X  distributed  uniformly 

over  ( C , X* ) ,  denoted  X  'v  U(0,X*),  and  another  independent 

random  variable  Y  'v  U(0,Y*).  As  realizations  of  each 

random  variable  arc  generated,  number  them  chronologically, 

i.e.  in  order  of  appearance.  Generating  n  (as  determined 

above)  such  realizations  of  each  random  variable  yields  2 

numbers:  x, , . . . ,x„ ,y, , . . . ,y„ . 

l  n  1  n 

The  final  problem  remaining  is  to  select  a  scheme  for 
mating  the  x-  and  y-  realizations  to  form  ordered  pairs 
which  will  constitute  the  realization  of  the  HPPP.  A  few 
such  schemes  are  enumerated: 

1.  The  sequence  <(x^,y^)>^E^  forms  a  HPPP. 

2.  If  the  y^  are  ordered  to  form  <y(j)>j=i»  then  the 
sequence  <^xj>»Y(i))>  forms  a  HPPP. 
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3.  Similarly,  f°rm®  ®  HPPP. 

Additionally,  any  random  permutation  of  the  in  ?, 
the  y^  in  3  or  either  random  variable  3n  1  can  be  uoed 
to  form  a  HPPP.  Thus  <(xn+1_1  ,y^  )>  forms  a  HPPP,  etc. 
The  goal  of  the  simulation  and  the  purpose  of  simulating 
the  process  as  a  part  of  the  overall  analysis  mut-t  now  be 
considered.  If  during  the  simulation  it  is  desired  to 
generate  independent  realizations  of  the  process,  th»o  each 
iteration  must  involve  a  selection  of  n,  the  drawing  of  2n 
uniform  variates  and  the  mating  of  the  variates  through  some 
scheme  such  as  those  outlined  in  steps  1—h  above.  On  the 
other  hand,  if  it  is  desired  to  utilize  variance  reduction 
techniques,  then  for  any  drawing  of  2n  random  variates 
several  schemes  could  be  used  for  the  mating  process.  Here 
independence  is  lost  immediately  and  this  loss  must  be 
balanced  by  some  gain  elsewhere  in  the  analysis. 

D.  ESTIMATION  AND  TESTING  FOR  THE  PARAMETER  FROM  A  HOMOGENEOUS 
PLANAR  POISSON  PROCESS  (HPPP) 

If  the  hypothesis  that  the  process  is  HPPP  with  some 
unknown  value  of  the  parameter  X  is  accepted,  one  might 
like  to  obtain  a  point  estimate  or  confidence  interval 
estimate  for  X,  or  to  test  that  the  process  has  some  given 
parameter  Xq.  Note  that  the  parameter  X,  which  was  considered 
to  be  a  nuisance  parameter  in  the  previous  section  where 
the  structural  aspects  of  the  process  per  se  were  tested, 
now  specifies  the  process  completely. 
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Since,  as  wan  seen  In  Section  D,  It  is  possible  to 
set  up  the  Joint  probability  density  function  of  tho 
observations  in  a  HPPP,  point  estimation  of  X  can  be 
based  on  the  method  of  maximum  likelihood.  Note,  however, 
that  each  observation  consists  of  a  single  "look”  at  (or 
realization  of)  the  process  rather  than  n  observations  of 
a  single  random  variable.  Since  it  is  a  stochastic  process 
the  observations  arc  not  independent  and  identically  dis¬ 
tributed.  Hence  the  usual  Justifications  for  maximum 
likelihood  procedures  are  not  valid;  see  Brown  [1972]  for 
extensions  of  maximum  likelihood  theory  of  estimation  to 
realizations  of  a  Poisson  process. 

Using  the  results  of  Brown  [1972],  suppose  that  n  HPPP 
events  are  observed  to  occur  in  a  rectangular  region  of 
area  X*Y*.  From  (11),  for  n  £  0, 

L  *  f{(x,y)^j,...,(x,y)^nj,n;X}  ■  Xne“*X  Y*  (16) 

In  L  *  nlnX  -  \X*Y«,  (0  <  X  <  ».) 

If  n  M,  this  function  is  -®  at  X  ■  0  and  X  ■  •  and  since 
Si  A1?.  A  s  0.  _  X*Y*,  the  slope  of  the  function  decreases 
monotonically  from  «  to  -X*Y*.  Thus  In  L  has  a  unique 

j  i  ^  r 

maximum  at  the  point  where  ^ -  *  0.  Setting  this 

derivative  equal  to  zero  yields  a  unique  maximum  likelihood 
point  estimate  for  X  as 

X  *  y-; ,  (n  >_  1)  (17) 
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X)  and  has  vorlanco 


where  X  la  unbiased  (since  F.(X)  ■  ■ 

X/XiYi.  Note  that  as  the  oboorved  area  X#Y*  becomea  large, 
the  variance  of  the  estimate  becomes  email;  thus,  by 
Chebyahev'a  Inequality  [Lampertl,  1966,  p.  20] 

A 

p{  I X  -  X|  >  a)  <  v-ag--  ,  (a  >  0) 

ft 

and  aa  X*Y*  •  , 


P{ I X  -  X|  >  a)  *  0 

A 

for  all  positive  a  and  hence  X  converges  to  X  In  probability. 

A 

The  latter  statement  Is  equivalent  to  the  assertion  that  X 
Is  a  consistent  estimator  for  X.  Also  since  the  variance 
of  X  is  X/X*Y*,  X  has  an  estimated  variance  X/X*Y*  ■  n/(X*Y1)2, 
and  an  estimated  standard  error  of  /n?X#Yi. 

If  n  ■  0,  the  above  method  Is  not  applicable.  In  thi 
case,  it  might  be  preferable  to  give  a  confidence  interva 
estimate  for  X.  Specifically,  a  one-sided  test  alternative 
is  used  to  generate  a  test  for  the  assumed  value  Xru11  using 
as  an  acceptance  region  only  n  ■  0.  Intuitively,  Xnull  will 
be  small  enough  so  that  *nunX*Y*  <  1  (i.e.,  the  expected 
number  of  observed  events  is  less  than  1).  The  hypothesis 
to  be  tested  is  H^:  X  ■  Xnull  vs.  H^:  X  >  Xnul^.  Defining 
a  level  of  significance  a  from  (16)  by 

-X  X*Y* 

prob{N  *  0 1 X  «  Xnull>  *  1  -  a  -  e  nul1  , 
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the  hypothesis  MQ  la  accepted  at  the  level  a.  Conversely, 
for  any  given  valuo  of  a,  *nuu  rosy  be  determined  by 


-  x„unx*Y,  ■  ln  a  -  «> 


xnull 


where  the  X  ^  thus  determined  is  the  largest  value  of  X 
that  the  teat  will  accept  at  the  a  level,  given  that  n  ■  0. 

Returning  to  the  case  of  r.  >  1,  in  order  to  test  that 
the  parameter  of  the  process  has  some  given  value  XQ,  assume 
that  n  events  from  a  HPPP  are  observed  in  a  region  of  area 
X*Y*.  The  hypothesis  to  be  tested  is  HQ:  X  *  XQ  against 
the  two-sided  alternative  X  ^  XQ  although  one-sided 
alternatives  can  also  be  considered.  Since  N  is  a  random 
variable  taking  on  all  nonnegative  integer  values  with  some 
positive  probability  for  any  Xq,  there  is  always  some 
possibility  of  on  observed  value  of  the  random  variable  N 
(the  observation  being  denoted  n)  falling  outside  any  finite 
range  of  values.  Thus  a  region  (n”,n+)  must  be  specified 
such  that  if  N  lies  in  the  region  the  hypothesis  HQ  is 
accepted;  otherwise  the  hypothesis  is  rejected.  The  level 
a  of  the  test  is  the  probability,  given  X  *»  XQ,  that  N 
falls  outside  the  region  (n",n+). 

Since  the  test  has  been  defined  to  be  two-sided,  the 
level  is  split  into  upper  and  lower  levels  a+  and  a” 
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so  that  a  ■  a+  +  a".  The  procedure  must  consider  values  of 
X  <  X0  as  well  as  values  of  X  >  XQ.  To  proceed,  it  is 
necessary  to  define 


P+(n+;XQ)  -  P{N  >  n+|X  -  XQ}  ■  a+ 


1  "V*** 

•  (XnX«Y*)Je  0 

1  +  — - n - 

J-n 


and 


(18) 


P_(n”;X0)  *  P{N  <  n“|X  -  XQ)  *  a“  (19) 

n-  (XnX*Y.)Joxp(-X0X‘y,) 

B  T  ■■  ■  ■  ■  .  — 

j.0  J' 

Thus,  for  a  given  a  ,  an  n  may  be  determined  such  that  the 
statement  (18)  just  holds.  Also,  for  a  given  a”,  a  n"  may 
be  determined  such  that  (19)  Just  holds. 

The  null  hypothesis  is  accepted  at  the  a  level  if  the 
observed  value  of  N  falls  between  the  two  prescribed  limits 
(n+  >  n”),  where  prob{N  i  (n",n+)}  =  a.  Note  that  as 
stated,  the  result  is  indeterminate  since  a,  once  given, 
leads  to  many  values  for  a+  and  a"  «  a  -  a+  which  satisfy 
the  given  a.  The  manner  of  selecting  a+  and  cT  must  be 
stated.  Arbitrarily  it  may  be  desirable  to  set  ct+  =  a”  ■  a/2 
Asymptotically,  this  choice  of  a  symmetric  acceptance  region 
is  reasonable  since  as  n  increases,  the  distribution  of  N 


35 


io  approaching  the  (symmetric)  normal  distribution.  The 

+  •• 

choice  of  equal  a  and  a  may  not  be  reasonable,  however, 
for  small  XqX* Y#  since  the  Poisson  distribution  is 
positively  skev/ed. 

The  statement  prob(N  i  (n",n+)|X  ■  Xq)  *  a  is  the  result 
of  the  test  of  the  hypothesis  X  ■  Xq  at  a  given,  fixed 
level  a.  It  is  this  result  from  which  one  must  usually 
draw  conclusions  regarding  specification  of  the  process. 

If  the  information  thus  available,  i.e.  HQ  is  rejected 
or  accepted  at  the  pre-determined  a  level,  is  deemed 
insufficient  for  the  purposes  of  a  decision  maker  (for 
example)  ther.  another  possibility  is  that  the  post-analysis 
information  ray  be  extended  by  determining  for  each  obser¬ 
vation  the  exact  a,  ae,  at  which  the  hypothesis  would  have 
been  rejected.  The  decision  maker  is  then  left  with  the 
problem  of  the  determination  of  his  own  level  of  significance, 
possibly  based  on  his  intuitive  grasp  of  the  problem  and 
its  significance  in  a  larger  frame  of  reference.  Once  he 
has  determined  his  preferred  significance  level,  the  hyp<  ;h- 
esis  is  rejected  or  accepted  at  the  specified  level  by 
comparison  with  ae<  Thus  the  decision  maker  has  gained 
some  influence  over  the  analysis  but  has  had  to  pay  with 
some  time  to  reflect  on  the  problem  at  hand.  Alternatively, 
he  can  use  ag  Informally  as  a  "goodness  of  fit"  of  the 
hypothesis . 

Using  (18)  and  (19) ,  the  significance  test  is  defined 
conventionally  [Cox  and  Lewis,  1966,  p.  30]  to  be:  the 
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hypothesis  X  »  Xq  would  be  accepted  at  the  level  of  signi¬ 
ficance  a  In  a  two-oldcd  equi-talled  test  if  the  observed 
numbe*  of  events,  n,  la  such  hnt  n,  when  used  alternatively 
In  (18)  and  (19)  (l.e.,  la  assumed  to  be  one  or  the  other 
of  the  cnd-poir.tB  of  the  acceptance  region),  produr* ::  a 

v 

as  u  solution  to 

P(n; Xq)  -  ?mln (P^ (n ; Xq) ,P_(n ; Xq) )  ■  (20) 

Note  that  each  observed  value  of  n  generate:;  a  new  a  for 

O 

any  assumed  Xq,  hence  ae  ■  a^fr.,  XQ).  Por  example, 

P(3'1;20)  -  .0*36.  P(?0;?0)  •  .7C?8  and  r(l*,;20)  ■ 

It  can  bo  soon  thr-t  the  fixed  level  procedure 
computational ly  simpler,  since  fer  a  specified  c*  at  d  X  ,  the 
interval  (n”,r.+  )  need  only  be  computed  once  while  ir.  the 

latter  procedure  a  must  be  recomputed  following  each 

c 

observation  of  N. 

The  inverse  of  the  above  apj roach  which  utilized  the 
two-sided  equi-tnlled  test  of  significance  for  a  given 
value  Xq  leads  to  the  determination  of  confidence  interval 
estimates  of  X.  Given  that  n  events  are  observed,  it  is 
required  to  determine  some  limits  on  the  range  of  X  such  that 
the  true  parameter  value  XB  lies  within  the  stated  limits 
with  a  probability  1  -  a.  That  is,  it  is  required  to 
establish  a  X”(N)  and  a  X+(N)  such  that 

P { X” ( N )  <  XB  <  X+(N)|N  -  n)  •  1  -  a.  (21) 
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Using  P{N  <_  n  |  X  ■  X+)  <_  1  -  a+  to  define  a  X+  as  the 
greatest  X  such  that  equality  Just  holds  and  similarly 
using  P(N  <  n|X  ■  X-}  ■  a"  to  define  a  X"  establishes  the 
limits  such  that  (21)  holds.  For  a  proof  of  this,  see 
Brownlee  [1965,  r*  121].  Note  that  for  each  realization 
of  N,  a  now  ordered  pair  (X“,X+)  is  defined  so  that  the 
ordered  pair  is  a  function  of  a  random  variable  and  hence 
is  itself  a  random  interval.  The  procedure  only  states 
that  for  (1  -  a)  x  100?  of  the  observations  the  true 
parameter  X*  will  lie  within  the  limits  selected.  The 
limits  for  observed  n  from  0  to  50  arc  tabulated  [Pearson 
and  Hartley,  1966,  Table  ^C], 

For  a  normal  approximation  to  the  confidence  Interval, 
Cox  and  Lewis  [1966,  p.  31]  define  the  upper  a  point  of  the 
unit  normal  distribution  as  cq,  and  give  the  relationship 


prob{-clQ 

2 


N  -  XX*Y*  „  . 

(xxn*)17"  - 


> 


1  -  a, 


the  relationship  being  correct  as  XX*Y*  ®.  The  confidence 
limits  thus  obtained  arc,  to  a  second  degree  of  approximation 
using  a  continuity  correction  and  the  estimate  o(X)  =  yn?X#Y*, 


n  +  5°lo  i  '!.*-• 

?  ? 

For  example,  if  50  events  are  observed  from  a  HPPP,  the 
exact  .05  confidence  interval  is  37.11  <  XX*Y*  <  65.92 
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[Pearson  and  Hartley,  1966,  Table  HO]  whereas  the  normal 

approximation  gives  37*79  1  AX*Y*  <_  66.07. 

There  also  exist  x2  approximations  to  the  significance 
tests  and  confidence  intervals  [Cox  and  Lewis,  1966,  p.  33 
Brownlee,  1965,  P*  1731* 


Ill .  NON-HOMOGENEOUS  PLANAR  POISSON  PROCESSES  (NHPPP) 


A.  GENERAL  DISCUSSION 

If  the  stochastic  process  described  above  is  generalized 
to  allow  the  probabilistic  structure  of  the  event  process 
to  be  dependent  on  the  location  of  the  events,  a  non- 
homogeneous  planar  process  Is  evidenced.  In  the  simplest 
such  case  a  non-homogeneous  planar  Poisson  process  (NHPPP) 
arises  if,  in  the  definition  of  the  Poisson  process  given 
above,  assumption  I  is  modified  to  become 

I".  There  exists  a  positive  finite  function  A(x,y)  >  0. 
Also  note  that  II  is  changed  by  the  fact  that  the  number  of 
events  in  any  region  is  not  only  a  function  of  the  area 
of  the  region,  but  also  depends  on  the  location  of  that 
region  within  the  universe  under  consideration.  Thus  A 
is  now  expressed  as  A  *  A(x,y),  and  assumption  II  becomes 
II".  probtNCR^)  -  n} 

exp{-A(A1)} 

iTl 

where  A(A.  )  *  .  /  A(x,y)  dxdy  the  symbol  ./  Implying  the 
1  Hi  Ai 

integral  over  an  area  and  A(x,y)  is  assumed  to  be  continuous 

over  (with  area  A^)  so  that  the  integral  Is  valid. 

Assumption  III  remains  unmodified,  i.e.  events  occur 

independently  of  any  other  event  or  collection  of  events. 

Under  the  additional  assumption  that  A(x,y)  is  continuous 

within  the  region  of  consideration,  the  incremental 


development  of  Chapter  II  may  be  extended  to  achieve  a 
description  of  the  NHPPP.  Additionally  the  continuity 
assumption  on  X  and  the  definition  of  the  parameter  in  the 
process  as  an  integral  over  X  eliminates  the  difficulties 
of  line  discontinuities,  although  there  may  be  cases  where 
this  is  an  important  component  of  the  problem.  This  problem 
is  not  considered  here. 

Referring  back  to  Figure  1  in  Section  II-A,  consider 
specifically  the  incremental  strip  defining  region  R^.  If 
the  strip  is  divided  into  n  sub-regions  of  equal  area  by 
taking  n  equal  increments  along  the  x  direction  each  of 
length  6x,  then,  under  the  assumptions  on  the  behavior  or 
X(x,y),  the  process  In  the  ith  sub-region  can  be  approximated 
by  a  HPPP  with  parameter  X  =  X(x,y)i  where  (Xjy^  is  an 
arbitrary  point  in  the  ith  sub-region.  Specifically  (and 
arbitrarily)  the  lower  left  point  is  chosen  for  the 
succeeding  discussion;  thus  the  parameter  for  the  first 
sub-region  has  parameter  X  -  X(0,Y*).  Continuing,  the 
probability  statements  for  occurrence  of  events  become 

P^C^y)  =  X(0,Y*)<5xAy  +  o(6x,Ay)  0  <_  x  <_  5x, 

Pj^Xjy)  =  X(6x,Y*)6xAy  +  o(6x,Ay)  6x  <_  x  <_  26x, 


P^x.y)  =  X(j5x,Y#)6xAy  +  o(6x,Ay)  jfx  <_  x  <  (J+l)6x, 


where  j  =  0,1,..., n-1,  Y‘  <  y  <  Y*+Ay  and  n<$x  =  X*. 


Hi 


Since  the  probability  of  more  than  one  event  in 
is  o(X#Ay)  the  probability  statements  above  are  additive 
and 

n-1 

prob  {one  event  in  R,  }  =  E  X(j <5x,Y* )6xAy  +  o(X#Ay). 

1  J-0 

In  the  limit  as  n  +  »,  by  the  definition  of  an  integral 

X* 

prob  (one  event  in  R, }  8  {  /  X(x,Y# )dx}Ay  +  o(X#Ay). 

1  0 

By  similar  argument , 

Y* 

prob  (one  event  in  R~}  =  {  f  X(X*,y)dy)  Ax  +  o(Y#Ax) 

*  0 

and 

prob  (one  event  in  R^}  ~  X(X# ,Y* ) AxAy  +  o(AxAy). 

By  comparison  with  equations  (3),  (4)  and  (5)  the  above 
statements  lead  to  definitions  for  average  parameters  for 
each  of  the  regions  R^  R2  and  R^  as 


X^XSY*) 


X* 


f  X(x,Y*)dx, 

0 


1  Y* 

X2(X*,Y»)  s  fW  Qf  Mx*,y)dy, 


(22) 


and 


X3(x*,y»)  =  X(X#,Y*). 
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Using  those  average  parameters,  the  equations  (3),  (4) 
and  (5)  are  generalized,  resulting  in  the  following 
statements : 

prob  {one  event  in  R^}  n  X^X*  ,Y#)X#Ay  +  o(X#Ay), 
prob  {one  event  in  R2)  -  X2(X* )Y#Ax  +  o(Y*Ax),  (23) 

and 

prob  {one  event  in  R^}  =  X3(X#,Y*)AxAy  +  o(AxAy). 

Using  the  result  (22)  as  defining  the  parameter  in  each 
of  the  incremental  areas  in  Figure  1,  equations  (6),  (7) 
and  (8")  become 

P  (X*+Ax,Y#)  =  P  (X#,Y*)[l-roY*Ax]+P  ,  (X* , Y* )X0Y*Ax 

n  n  d  n-i  d 

+  o ( Y*Ax ) ,  (24) 

Pn(X\Y#+Ay)  =  Pn(X*,Y*)[l-riX«Ay]+Pn_1(X»,Y«)5T1X«Ay 
+  o(X*Ay) ,  (25) 

and 

Pn(X*+Ax,Y*+Ay)  =  Pn(X*,Y*+Ay)  +  Pn(X*+Ax,Y«)  -  Pn(X»,Y*) 

-  X3AxAy[Pn(X*,Y*)  -  Pn_1(X*,Y*]  (26) 

+  X1X2X»Y*AxAy[Pn(X*,Y*)-2Pn_1(X*,Y«) 

+  Pn_2(X*,Y*)] 

+  o(Y*Ax)  +  o(X#Ay)  +  o(AxAy). 
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Rearranging  terms,  dividing  by  AxAy  and  taking  the 
double  limit  as  fix  +  0  and  Ay  -*■  0  yields 

32P  (X»,Y*) 

TO* - X3CPn(XSy)  -  P^tX-.Yi)]  (27) 

+  X1X2X«Y*[Pn(X*,Y«)-2Pn<_1(X*,Y*)+Pn_2(X«»,Y*)] 

which,  together  with  the  boundary  condition  that  PR  is  a 
probability  statement,  gives 

P  /v*  v*n  B  (A(X«,Y«))n  exp{-A(X*,Y*)},  n  =  0,1,... 
nlA  ,Y  '  n!  (28) 

where 

X#  y* 

A(X* ,Y»)  =  /  /  X(u, v)  dudv.  (29) 

,  0  0 

Thus,  the  number  of  events  occurring  in  a  region 
bounded  by  the  coordinate  axes,  x  =  X*  and  y  =  Y*  hae  a 
Poisson  distribution  with  mean  given  by  A(X*,Y*).  The 
mean  can  be  considered  to  reflect  the  cumulative  effect  of 
X(x,y)  in  the  region  of  concern. 

If  n  events  from  a  NHPPP  are  observed  to  occur  in  a 
rectangular  region  defined  as  usual  with  area  X#Y*,  and  the 
events  occur  at  (x,y)^,  i  *  l,...,n,  the  labelling  done 
on  the  magnitude  of  the  y-component,  then  the  joint  density 
of  the  events  and  the  probability  that  the  number  of  events 
in  X*Y*  is  n  is  given  by 


(30) 


n  X((x,y)m)exp{-A(X«,Y*)}. 

J-l 

Note  that  (30)  is  a  direct  generalization  of  (16). 

Hence  the  NHPPP  can  be  described  in  a  fashion  similar 
to  the  HPPP,  but  the  expressions  have  acquired  increased 
complexity  due  to  the  necessity  for  the  inclusion  of 
integrals  to  define  the  parameters.  The  degree  of  added 
complexity  is  dependent  upon  the  enoice  of  the  specific 
functional  form  for  X(x,y).  The  next  section  develops  the 
expressions  for  one  specific  form. 

B.  A  SPECIAL  CASE 

To  consider  the  location  dependent  type  of  process, 
a  particular  form  for  X(x,y)  is  chosen  as 


X(x,y)  =  exp  {«+  6x  +  yy  +  <5xy}.  (31) 

Note  that  if  Bx  +  yy  +  6xy  changes  very  little  over  the 
range  of  interest  of  x  and  y,  then 

X(x,y)  =  (1  +  Bx  +  yy  +  6xy)exp{a).  (32) 


Other  relationships  may  be  used;  however,  they  may  cause 
necessary  and  untidy  restrictions  on  the  values  which  the 
constants  a,  3,  y  and  6  may  assume.  In  particular,  X(x,y) 
must  be  greater  than  0  and  the  bivariate  exponential 
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polynomial  (31)  ensures  this  with  no  restrictions  on  the 
range  of  the  parameters. 

Additionally,  algebraic  manipulation  of  the  form  reveals 
that  the  curves  of  X(x,y)  *  c,  c  a  constant,  include  some 
interesting  properties. 

1.  If  6  =  0,  then  In  X(x,y)  =  c  is  a  family  of  straight 
lines  in  the  plane,  intersecting  the  x-axis  at  an  angle 
0  *=  tan"’^(-0/Y) .  In  this  case  a  clock-wise  rotation  of 
the  coordinate  axes  through  an  angle  0  would  give  an 
exponential  function  of  y  only. 

2.  If  6  /  0,  then  In  X(x,y)  =  c  describes  a  system  of 
contour  lines  which  form  a  hyperbolic  paraboloid  with  a 
saddlepoint  at  (-y/6,-3/6)  as  is  illustrated  in  Figure 
It  may  be  helpful  to  interpret  the  Figure  in  terns  of  a 
section  of  forest  which  has  been  sampled.  The  line  r 
describes  a  possible  direction  of  steepest  ascent  (DSA) 
which  passes  through  or  near  the  region  being  sampled. 

This  DSA  may  not  be  a  topographic  feature  but  rather  a 
mathematical  expression  for  a  possible  increase  in  density 
of  trees  along  some  line.  Obviously,  there  may  exist  a 
strong  correlation  between  this  mathematical  DSA  and  some 
topographic  features.  Note  here  that  along  the  DSA  maximal 
values  for  X(x,y)  are  found  in  the  sense  that  departing  the 
DSA  at  right  angles  leads  to  decreased  values  for  X(x,y), 
i.e.  decreases  in  the  forest  density. 


X  decreasing/ 


X  increasing 


P  \(x  v)  =  exp(a+f5x+Yy+6xy } 

Figure  4.  Contour  lines  for  XCx,yJ 

Note  asymptotes  and  region  being  described 

(hatched.)  Here  f$/Y  -  2  ,  6/6  =  4  and 

all  coefficients  are  positive. 


3.  The  exponential  form  can  be  extended  with  little  con¬ 
ceptual  difficulty,  but  possibly  greatly  increased  mathe¬ 
matical  difficulty,  to  describe  a  much  wider  range  of 
possible  circumstances.  For  instance,  it  is  reasonable 

to  assume  that  the  DSA  line  will  bend;  hence  terms  such  as 
2  2 

ex  y  and  £xy  and  higher  order  may  be  included  in  the 
exponent . 

For  the  special  form  of  (31),  the  cumulative  or  integrated 
intensity  function  A(X*,Y#)  is  given  by  equation  (29)  and 
becomes 


A'(X#,Y») 


A(X«,Y»).<5 
exp Tct-lTyTS)' 


Ki{(Y+<5X*)(f+Y*)  }-Ei{(Y+SX*)f} 

(3d) 


ET{( $+$Y*)-^}  +  Ei , 


where  ET  (•)  is  the  exponential  integral  and  ET(  • )  =  c  +  In 

®  /  n! 

+  l  ,  where  c  »  .577216  is  a  constant,  as  defined  in 

i=l1 '  x 

Jahnke  and  Emde  [1945,  p.  2], 

The  likelihood  function  for  the  NHPPP  may  be  developed 
in  a  manner  similar  to  that  used  in  the  discussion  of  the 
HPPP.  The  discussion  leading  up  to  (16)  is  modified  by  the 
fact  that  the  parameter  is  location  dependent  resulting  in 


(•) 


L  =  exp{-A(X«,Y*)}  n  X((x,y)m),  (n  >  1)  (3*0 

i=l  '  ' 

>here  (x,y)^  is  a  labelling  of  the  coordinates  of  the  n 
point  events.  Thus, 
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n 

In  L  -  -A(X«Y»)  +  Z  In  X((x,y)m) 

i=l  u; 

follows  directly.  For  the  special  case  of  X(x,y)  given  by 
(31),  equation  (3*0  becomes 

n  n  n 

In  L  =  -A(X*Y*)+B  E  x.+y  E  y,+6  E  x,y.+nct  (35) 

1=1  1  1=1  1  1=1  1  1 

where  A(X#Y* )  Is  given  by  (33). 

The  above  joint  density,  or  likelihood,  function  pro¬ 
vides  a  functional  form  which  may  be  manipulated  to  accom¬ 
plish  the  two  principal  concerns  of  the  analysis  cp  point 
processes:  hypothesis  testing  and  parameter  estimation.  The 
obvious  null  hypothesis  is  HQ :  $  =  y  =  6  =  0,  in  which  case 
the  nonhomogeneous  Poisson  process  is  being  tested  for  homo¬ 
geneity  since  a  non-zero  a  yields  a  constant  parameter 
X  =  exp (a)  >  0.  Should  the  above  null  hypothesis  be  rejected, 
then  the  analysis  proceeds  to  develop  estimates  for  the 
parameters  8,  y  and  6.  This  phase  of  the  analysis  may 
proceed  differently  depending  on  how  many  and  which  of  the 
parameters  were  tested  as  being  different  from  zero.  The 
complete,  and  most  complicated,  situation  develops  when  all 
parameters  are  determined  to  be  non-zero.  Testing  of 
parameters  is  the  topic  of  Chapter  IV  while  Chapter  V 
discusses  the  estimation  of  parameters  determined  to  be 
non-zero  as  a  result  of  the  testing  procedure. 
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IV.  TESTING  FOR  NON- ZERO  PARAMETERS 


A.  PRELIMINARIES 

It  is  desired  to  formulate  a  method  for  testing  the 
data  (i.e.,  the  number  of  events  and  their  locations)  in 
order  to  determine  which  of  the  parameters  in  the  model 
given  by  (35),  specifically  a,  S,  y  and  6  ,  are  non-zero. 

Note  that  three  assumptions  are  inherent  at  the  outset: 
first,  that  the  NHPPP  model  is  valid;  second,  that  the 
testing  for  homogeneity  in  Section  II-B  led  to  the  rejec¬ 
tion  of  the  hypothesis  of  homogeneity;  and  third,  that  the 
physical  phenomena  can  be  modelled  by  the  NHPPP  given  by 
(34)  with  the  parameter  A(x,y)  given  by  (31). 

Testing  the  Poisson  hypothesis  per  se  when  the  function 
A(x,y)  is  not  known  is  a  compound  problem  which  will  not  be 
considered  here.  It  is  analagous  to  the  compound  probler  in 
regression  analysis  of  testing  both  for  an  unknown  regre:  .ion 
function  and  for  independent  equal  variance  errors. 

From  the  third  assumption,  the  likelihood  function  for 
the  data  is  given  by 

L  ®  exp{-^\.(X#,Y*) Hn  expCBEXj^  +  yEy^^  +  fiEx^}  ,  (36) 

where,  for  clarity  in  the  future  development,  £  =  exp{a}. 
Conditioning  on  the  occurrence  of  n  events,  n  >  1  and 
defining  L{(x,y)^  ,. . .  »(x,y)(n)  |n;A(x,y)}  *  L(n)  leads  to 
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,  nf  oxpCBEx.  ♦  Yly«  ♦  Blx,y,} 

«">  *  probllT.T)'  *  V  X  ' - 1 - - - - 

(  /  /  OXp(BU  ♦  yV  4  «UV)  dudv) 

0  0 

(37) 

where  LOO  lo  read  "the  likelihood  function  conditioned  on 
the  occurrence  of  n  events."  Note  that  conditioning  on 
the  number  of  events  observed  has  resulted  in  an  expression 
which  is  independent  of  the  parameter  C(or  <*),  l.e.  for 
given  6,  y,  and  6,  n  is  a  sufficient  statistic  for  a.  This 
is  convenient  because  a  here  is  a  "nuisance"  parameter  since 
the  terms  of  interest  are  those  which  would  indicate  non- 
homogeneity  rather  than  the  establishment  of  the  overall 
rate  of  occurrence.  Thus  by  using  the  conditional  likeli¬ 
hood  a  may  be  eliminated  and  the  testing  can  proceed  for 
non-rero  B,  y  and  B.  In  other  words  the  value  of  a  should 
not  influence  the  testing  for  non-homogeneity  parameters. 

If  p  •*  0,  certainly  no  departure  from  homogeneity  could 
be  evidenced  and  hence  this  case  is  covered  by  l!PFF;  see 
II-B  above.  Hence  the  case  of  interest  is  n  >  1, 

Physically,  the  model  (35)  gives  rise  to  a  parameter 
surface  X(x,y)  which  has  the  properties: 

(a)  6^0;  Y^O;  6p*0:  lnX  forms  a  hyperbolic 

paraboloid  superimposed  on 
a  tilted  plane,  l.e.  some 
"warping"  of  the  tilted 
plane  is  evidenced. 

(b)  B  ^  0;  y^Oj  6-0:  lnX  forms  a  plane,  tilted 

with  respect  to  the  x-y  plane. 

(c)  6  ■  Y  *  0»  6p*0:  lnX  forms  a  hyperbolic 

paraboloid. 
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(d)  0  ■  Y  ■  6  ■  0:  In  X  forms  a  plane  parallel 

to  the  x-y  plane,  l.e.  a 
HPPP  Is  evidenced. 

There  are  a  number  of  possibilities  for  testing: 

(a)  A  test  of 

H0:  g  B  y  =  6  *=  0 
against 

at  least  one  of  the  parameters  0,  y,  6  /  0 

is  a  test  for  non-homogeneity  which  is  more  specific  than 
those  in  Section  II-B  and  is  easily  derived  by  likelihood 
ratio  techniques. 

(b)  The  above  test  is  not  of  great  interest;  generally  the 
specilic  non-zero  parameter  is  desired  rather  than  Just  that 
at  least  one  of  the  three  is  non-zero.  This  leads  to  the 
question  of  selecting  the  significant  subset,  a  problem 
which  is  difficult  and  as  yet  is  unresolved. 

(c)  The  simpler  problem  is  to  assume  an  ordering,  i.e.  that 
if  0  e  y  “0,  the  process  is  homogeneous  (5  is  then  assumed 
to  be  0)  and  if  0  or  y  is  non-zero  but  6  =  0,  then  higher 
order  terms  are  assumed  to  be  zero.  However,  if  the  test 
indicates  non-zero  8  or  y  this  may  be  due  to  an  aliasing 
effect  because  of  a  non-zero  6.  If  further  testing  of 

6  =  0  against  6/0  reveals  6/0,  then  it  may  well  be  that 
the  true  situation  is  0  =  y  ■  0  but  6/0.  The  procedure 
to  be  followed  will  not  discriminate  this  case. 
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The  same  aliasing  effect  occurs  in  testing  of  6  =  0 
against  6^0  where  $  and  y  are  non-zero  and  it  is  desirab1 
to  perform  this  test  without  the  effects  of  the  non-zer> 

0  and  y.  These  are  thus  nuisance  parameters,  as  was  the 
case  with  a  in  testing  0  and  y.  For  the  present  model  (35), 
one  can  eliminate  these  parameters  because  it  is  seen  from 
the  exponential  form  (36)  that  for  any  6,  (n,  Zy^) 

is  a  set  of  sufficient  statistics  for  (a,  0,  y).  Thus 
6  “  0  is  tested  with  some  function  of  Zx^.^  given  n,  Zx.^ 
and  Zy^  This  statistic  has  a  distribution  independent 
of  the  parameters  a,  0,  y. 

The  reason  for  basing  the  conditional  test  on  Zx^y.^ 
is  that  this  is  (conditionally)  a  sufficient  statistic 
for  <5. 

B.  SPECIFIC  TESTS 

Assuming  that  some  ordering  exists  on  the  parameters  3 
discussed  in  possibility  (c)  above,  tests  are  performed 
using  the  sufficient  statistics  (n,  Zx^  Zy^  Zx^y^  to 
determine  if  any  non-homogeneity  is  evidenced  by  the  data 
(i.e.,  through  the  statistics).  This  testing  is  more 
specific  in  nature  than  the  testing  encountered  in  Section 
II-B  above  due  to  the  selection  of  a  particular  model. 

The  set  of  sufficient  statistics  arises  from  this  choice  of 
a  specific  model  to  use  as  an  alternative  to  homogeneity. 
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The  testing  will  assume  the  following  sequence: 

(i)  Condition  on  n  and  set  6  =  0.  Test  ft  ■  y  *  0 

against  ft  /  0  or  y  ?  0.  Note  that  it  would  not  be 

informative  to  test  either  ft  or  y  as  a  separate  entity  since 
in  the  formulation  of  the  model  ft  and  y  are  unique  only  up 
to  an  angle  of  rotation.  That  is,  testing  of  ft  and  y 
Jointly  amounts  to  the  detection  of  any  tilt  in  In  X(x,y) 
with  respect  to  the  x-y  plane,  regardless  of  the  direction 

of  the  tilt.  Failure  to  reject  leads  to  the  assumption 

of  homogeneity  due  to  the  assumed  ordering. 

(ii)  Rejection  of  H0(i)  leads  to  testing  of 

H0(ii):  6  =  0,  -«  <  ft  <  00  and  -<«  <  y  <  « 
against 

Hl(li):  5  ^  0;  -»  <  ft  <  “  and  -°°  <  y  <  m. 

The  test  thus  specifies  y  and  ft  as  nuisance  parameters. 

In  this  test  it  is  necessary  to  first  condition  on  n,  Ex^ 
and  Eyi  to  eliminate  the  nuisance  parameters. 

In  (i),  conditioning  on  n  and  setting  6=0  leads  to 

(fty)n  n!  exp {ft Ex.  +  yEy . } 

L(n)  -  - K 1 - r  . 

(exp{ftX)  -l)n  (exp(yY)  -  l)n 

From  this  it  is  seen  that  the  statistics  (Ex^,Ey^)  are 
(conditionally)  Jointly  sufficient  statistics  for  ft  and  y. 
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Under  H0^j, 

Ex1/n  -v  N(X*/2,X*2/12n)  and  Ey^n  -  N(Y */2  ,Y«2/12n) 

and  the  statistics  are  independent  (see  Section  II-B).  Hence 
the  expression 

Ex./n  -  X*/2  2  Ey,/n  -  Y*/2  2 

— )  +  (_i__ — ) 

X»/  /l2n  Y#/  /l2n 

p 

is  asymptotically  X2*  Rejection  or  acceptance  of  R0(i) 
is  based  on  the  adherence  of  the  calculated  value  of  this 
sum  to  the  x  distribution  ,  i.e.  Hq  is  accepted  if  this 
sum  has  sufficiently  small  values.  Acceptance  of  Hq^, 
as  stated  earlier,  leads  to  assumption  of  HPPP;  refer  to 
Chapter  II. 

Following  the  rejection  of  H^^  it  is  necessary  to 
proceed  with  testing  of  Hq^j.  As  can  be  seen  from  an 
examination  of  (37),  the  complexity  of  the  exact  distribu¬ 
tion  following  another  conditioning  argument  (i.e.  on 
n,  Exi  and  Eyi)  is  prohibitive.  However,  for  large  sample 
sizes  the  conditional  distribution  can  be  approximated  from 
the  fact  that  Ex^n,  Ey^n  and  Ex^/n,  conditioned  on  n, 
are  Jointly  normally  distributed  for  large  n.  Thus  the 
asymptotic  distribution  of  Ex^y^/n,  given  n,  Ex^/n  and 

can  b®  found  from  normal  theory  multiple  regression 
results. 
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Under  the  assumption  that  (5  =  y  c  6  *  0,  the  trivariate 
normal  distribution  which  arises  is  characterized  by  a  vector 
and  a  matrix.  The  vector  (^)  of  expected  values  and  the 
variance-covariance  matrix  (E_)  are  given  by 


and 


rX2/12n 


kX*Y/2iln 


0 

Y2/12n 

XY2/2^n 


X2Y/24n 

XY2/2^n 

7X2Y2/lMn 


from  which  p12  =  0  and  p.^  “ 
In  the  model  given  above 


p23 


0.65465. 


H0(il);  4  “  °S  -“  <  B  <  -;  -  <  Y  <  » 

is  to  be  tested  against 

Hl(ii>:  4  *  0i  -  <  B  <  -i  -  <  Y  <  »• 

Since  Ex^  is  a  sufficient  statistic  for  6  when  n,  Ex^ 
and  Ey^  are  given,  the  test  can  be  based  on  Ex^y^.  Its 
asymptotic  (conditional)  normal  distribution  has  mean 

I 

y  and  standard  deviation  o’  given  by 
xy 
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-  ISJL  p  (Ix^n  -  X/2)  + 


'xy  '  pxy  ‘  °x  K!3 


+  ^  P23  (XV i/n  -  */2) 

y 


i 

.  2  2  ,7 

°xy  ’  °xy  (1  ‘  P13  '  P23  ‘ 


„  r*lyl/n  -  Pxy  ls  distributed  as  a  unit 

Thus  under  Hq^)*  o£“ 

-  „  yls  accepted  if  this  statistic  has 

normal  variate  and  is  a  p 

sufficiently  snail  values.  Failure  to  reject  H0(il)  «uld 
imply  that  the  In  X(x.y)  plane  Is  tilted  with  respect  to 
the  x-y  plane,  but  no  "warping”  Is  evidenced. 

The  above  development  relies  heavily  on  asymptotic 
assumptions.  Small  sample  problems  will  be  much  more 
difficult  to  analyze.  Any  point  In  the  above  procedure  which 
lead  to  rejection  of  any  hypothesis  would  require  the  analysis 
to  proceed  with  the  estimation  of  the  non-zero  parameters. 
This  ls  the  subject  of  the  next  chapter. 
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V. 


estimation  of  parameters 


It  is  desired  to  formulate  a  method  for  estimating  the 
parameters  a,  8,  Y  and  6  of  the  non-homogeneous  planar 
Poisson  model  given  In  IV-A  where  It  has  been  established 
that  a  non-homogeneous  process  is  evidenced  by  the  data. 
Taking  the  logarithm  of  the  conditional  likelihood 

function  (37)  results  in 

in  L(n)  -  in  n>  ♦  BS*t  *  Y^  -  «Vi  +  n  **  <38> 

where  A  -  A(X*  Point  estimation  of  (a,  6,  Y,  «> 

by  the  method  of  maximum  likelihood  uses  the  conditional 

.  / -j p\  dpveloD  the  estimates >  See 

likelihood  function  (38)  to  develop 

Section  II-D  for  comments  regarding  use  of  maximum  likelihood 
in  this  application.  The  solution  to  the  set  of  simultaneous 

equations 

_  n  ;Y*  /  u  exp(Bu  +  yv  +  6uv)  dudv  -  0 
i  A  o  0 


iy 


.  n  /*  /’v  explBu  +  yv  +  «uv)  dudv  -  0  (39) 


i  A  0  o 

*Vi  -  r  /  uv  e 


Y*  X*  xpUu  +  yv  +  6uv>  dudv  -  0  t) 


A  ^ 


0  o 

if  obtainable,  provides  the  point  estimates  8.  y  and  6. 
Note  that  this  approach  neglects  the  homogeneous  term 


during  the  estimation  of  the  parameters  giving  rise  to 
non-homogeneity.  The  neglected  parameter  may  be  estimated 
last. 

AAA 

In  order  for  the  solution  (8,  y,  $)  to  equations  (39) 
to  describe  a  relative  maximum  to  In  L|n,  it  is  necessary 
and  sufficient  that  the  matrix  of  second  partial  derivatives 
(£)  be  negative  definite,  see  Frisch  [1966,  p.  120].  In 
examining  this  matrix  in  the  case  of  (38),  it  is  helpful 
to  define  S(u,v)  *  exp  (Bu  +  yv  +  duv).  Then  the  function 


s(u,v) 


S(u.v) 

A 


has  the  properties: 

(a)  s(u,v)  >  0 

Y*  X* 

(b)  /  /  s(u,v)  dudv  »  1 

0  0 

(c)  s(u,v)  is  continuous  on  [0  <  u  <  X*,  0  v  <_  Y*]. 


Hence  s(u,v)  is  a  probability  density  function  [Gnedenko, 
1962,  p.  171]. 

Hence  the  matrix  £  can  be  shown  to  have  diagonal 
elements  such  as 


o 


11 


Y*  X«  2 

n [  /  /  u*s(u,v)  dudv  -  ( 

0  0 


y»  x* 

/  /  us(u,v)  dudv)  ] 

0  0 


■  -  n  Var  U. 
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Continuing,  the  result  i 
W  =  UV) 

(-n  Var  U 
-n  Cov  (U,V) 

-n  Cov  (U,W) 

and  £  is  revealed  to  be 
condition  for  a  relative  maximum,  i.e.  Z  negative  definite, 
is  independent  of  the  realizations. 

~  -3 

Now  £  »  -nJ£  where  £  is  the  usual  variance-covariance 

matrix  for  a  tri-variate  distribution.  But  Z  is  positive 

semi-definite  [Gnedenko,  1962,  p.  212],  hence  -£  is  negative 

semi-definite.  That  each  of  the  principal  minors  has 

non-zero  determinants  remains  to  be  shown. 

By  the  expressions  giver,  in  Gnedenko  [1966  ,  p,  212], 

the  covariance  matrix  2  can  be  seen  to  be  a  Hankel  matrix 

[Gantmacher,  1_,  1959,  p.  338].  Hence  if  the  rows  of  Z  are 

linearly  independent,  then  the  determinant  of  £  >  0.  But 

also  Var  U  >  0  since  U  is  a  random  variable  and  Var  U  Var  V  - 
2 

Cov  (U,V)  >  0  since  the  case  of  line  discontinuities  has 
been  excluded  (i.e.,  U  cannot  be  a  linear  function  of  V). 

By  the  same  reasoning,  W  is  linearly  independent  of  U  and 
V,  Hence  all  principal  minors  are  greater  than  zero,  hence 
Z  is  positive  definite,  hence  I  is  negative  definite. 

AAA 

Thus  (0,  y,  <$)  provides  at  least  a  relative  maximum  to 
In  L|n. 


s  (where  W  is  defined  to  be  the 


-n  Cov  (U,V)  -n  Cov  (U,W)' 
-n  Var  V  -n  Cov  (V,W) 
-n  Cov  (V,W)  -n  Var  W 


a  covariance  matrix.  Note  that  the 
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If  it  were  possible  to  determine  that  (3,  y,  6)  provides 
a  global  maximum  to  In  L|n  in  the  region  of  interest,  then 
conclusions  as  to  uniqueness  of  the  estimator  could  be 
drawn.  Unfortunately,  global  extrema  are  difficult  to 
establish.  Since  the  method  of  estimation  used  was  maximum 
likelihood,  the  estimates  are  consistent.  Questions  of 
biasedness  are  unresolved. 

In  order  to  solve  the  system  of  equations  (39),  it  is 
necessary  to  determine  initial  values  for  the  parameters  as 
a  starting  point  for  an  iterative  procedure.  The  partial 
differentiation  of  In L  (35)  with  respect  to  the  parameters 
and  setting  these  partials  equal  to  zero  results,  after 
some  algebraic  manipulation,  in 


n  -A(X,Y) 


Ex 


rc  YY(e(B+«Y)X  _x) 
i  +  t  "  T  L  3+Sy 

t,  .  6  a  e“  re«(e'^4»Y  -1) 

Eyi  +  «A'  T  c - yOT - 


eBX  -  1 


]  -  0 


eyY-l 


•]  *  0  («0) 


-  «XY[eBX+YY+6XY  -  1]  -  3X[eYY  -  1]  -  yY[e6X  -  1])  -0 


If  it  is  assumed  that  the  sum  $X  +  yY  +  $XY  is  small 
(near  zero)  as  well  as  the  individual  terms  in  the  summation 
being  small,  then  the  exponentials  can  be  approximated  by 
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exp(x)  S  l  +  x,  *  near  zero.  Using  the  first  equation 

in  system  (40)  to  give  the  Value  for  A(X«,V),  i.e. 

A(x,>v.)  .  n>  ana  the  linear  approximation  in  the  remaining 

terms  gives  the  abbreviated  system: 


Zx 


i 


H  0 


zy 


■  o 


(4i) 


-62XY  +  BYXY]  -  0 


The  solution  to  (41)  provides  the  initial  estimates  for  the 
parameters.  These  estimates  ean  then  be  used  in  (39)  or 
(bO)  to  search  for  sequentially  closer  and 
approximations  in  a  mathematical  programming  approach. 

Following  the  determination  of  the  estimates  6,  V 
and  S.  I  can  be  determined  from  the  solution  to  the  firs 

equation  in  the  set  (40). 

The  determination  of  confidence  Intervals  and  levels 
of  slcnlflcance  is  not  considered. 


VI.  CONCLUSIONS 


The  procedures  in  Chapters  III-B,  IV  and  V  are  dependent 
on  the  particular  choice  of  parameter  form;  however,  with 
different  forms  the  concept  of  a  non-homogeneous  planar 
Poisson  process  may  be  used  to  describe  a  wide  variety  of 
"randomly"  occurring  phenomena.  The  choice  of  parameters 
which  may  be  used  is  limited  only  by  assumption  I,  l.e. 
positivity.  One  advantage  of  the  method  discussed  herein 
over  previously  proposed  schemes  is  the  fact  that  the  1 
specific  form  used  admits  the  possibility  of  a  ridge  or 
line  of  maximum  density  to  be  mathematically  specified 
and  estimated. 

Also  there  is  an  attempt  to  describe  the  underlying 
process  that  caused  the  points  to  appear  where  they  did, 
as  opposed  to  using,  for  instance,  the  arc  within  which 
the  most  events  were  observed  as  the  point  estimate  for 
the  direction  of  maximum  increase. 

Further  efforts  in  this  area  Include  a  generalization 
Into  four  dimensions  (x,y,z,t)  in  order  that  zoological 
as  well  as  botanical  densities  may  be  studied.  Of  especial 
interest  is  the  estimation  of  densities  of  aquatic  life 
and  how  the  observed  density  fluctuates  with  season  and 
with  changes  in  environment.  The  latter  problem  seems  of 
prime  importance  in  evaluating  the  effects  of  anti-pollution 
programs  on  the  fluid  systems  in  which  plants  and  animals 
exist . 


63 


Another  problem  which  Is  closely  related  to  the  above 
Is  that  of  Imperfect  sampling  and  how  the  estimates  are 

biased  by  sampling  techniques. 

Chapters  III,  IV  and  V  may  be  redefined  in  terms  of 
data  gathered  within  a  circle  about  some  fixed  point, 
especially  with  consideration  of  the  relative  efficiency 
of  this  data  form  referred  to  by  Matern  [I960], 


Olvcn  .  region  R  In  E?  of  are*  A  and  the  fact  that  the 
jrobablllty  of  occurrence  of  an  event  In  any  aub-regton  R, 
3f  area  A,  within  H  la  alr.ply  A,/*,  a  bivariate  uniform 
distribution  la  doacrlbed.  For  def Inltene.a  assume  the 
region  R  la  rectangular,  oo  A  -  X»Y*.  Now 

„  JSLl  Prob  (x  -  x)  Pr°b  (Y  1  y) 

for  0  <  x  <  X*.  0  <  y  i  V,  in  which  caae  It  la  apparent 
that  the  coordinate  axes  define  Independently  chosen 
univariate  random  variables. 


the  density  function 


M  _ 1  J  tl 


f(x.y)  - 


o  <  x  <  x* ,  o  <  y  <  **• 


From  the  density  function  the  Joint 
bivariate  uniform  random  variables 


density  for  n  indepen  ent 
is 


rCU.y^ 


,(x,y)n,n) 


l/(X*Y*)n 


where  (x.y).  denotea  the  1th  pair  of  random  variable, 
selected.  Now  n  pairs  of  random  variables,  or  more  .Imply 
„  points  in  the  plane,  can  only  be  ordered  (without 
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replacement)  In  n!  ways,  Independent  of  the  ordering  process 
chosen.  Hence,  the  Joint  density  function  for  n  ordered 
bivariate  uniform  random  variables  is 

f((x»y)(1j»»«»»(x»y)(n)»n)  B  n!/(X*Y#)n 

i,  u 

where  (x>y)(i)  is  the  *  point  selected  in  the  ordering 
scheme  utilized. 

As  a  specific  example,  consider  the  n  points  to  be 
labelled  with  respect  to  increasing  magnitude  of  the  y~ 
component.  Then 


yk  °  y(k)  k  c  !»•••» n  and  Cx,y)(k)  =  ^xk>y(k)^ 


If  the  x-components  are  also  ordered,  then  the  set  of 

p 

points  P  **  »y(j )) »  c  l»*..,n)  defines  n  points, 

of  which  n  are  known  to-be  "occupied,"  that  is,  to  describe 


an  event .  For  x(i ) » 


there  exists  some  J  such  that  y 


(J) 


gives  the  y-coordinate  value  for  the  event  which  gave  rise 


to  xq)*  Similarly,  for  x^2)  there  are 

now  n-1  J’s  remaining,  one  of  which  must  correspond  to  the 


event  giving  rise  to  Continuing  to  x(n)>  there  can 

only  be  one  J  left  to  be  associated  with  the  last  x-value. 
Thus  there  are  n!  combinations  of  (x,y)^,  i  *  l,...,n 
each  having  density  of  l/(X*Y*)n  and  so  the  ordered  bivariate 


uniform  density  is  established  as  that  stated  above. 
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